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Abstract. Starting from the full-shift on a finite alphabet A, mingling some 
symbols of A, we obtain a new full shift on a smaller alphabet B. This amal- 
gamation defines a factor map from (A^,Ta) to (B N ,Ts), where Tj\ and Tb 
are the respective shift maps. 

According to the thermodynamic formalism, to each regular function ('poten- 
tial') ip : A N — > R, we can associate a unique Gibbs measure fi^. In this article, 
we prove that, for a large class of potentials, the pushforward measure (i^on -1 
is still Gibbsian for a potential <j> : — ► K having a 'bit less' regularity than 
tp. In the special case where ip is a '2-symbol' potential, the Gibbs measure 
/J.^, is nothing but a Markov measure and the amalgamation it defines a hidden 
Markov chain. In this particular case, our theorem can be recast by saying 
that a hidden Markov chain is a Gibbs measure (for a Holder potential). 
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1. Introduction 



From different viewpoints and under different names, the so-called hidden Markov 
measures have received a lot of attention in the last fifty years [3J. One considers 
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a (stationary) Markov chain (X n ) ng N with finite state space A and looks at its 
'instantaneous' image Y n := n(X n ), where the map tt is an amalgamation of the 
elements of A yielding a smaller state space, say B. It is well-known that in 
general the resulting chain, (y n )„ e N, has infinite memory. For concrete examples, 
see e. g. [I] or the more easily accessible reference [3] where they are recalled. 

A stationary Markov chain with finite state space A can be equivalently defined as 
a shift-invariant Markov measure fi on the path space A N (of infinite sequences of 
'symbols' from the finite 'alphabet' A), where the shift map T : A N — » A N is defined 
by (Ta)i = Oj+i. A hidden Markov measure can be therefore seen as the the 
pushforward measure o tt" 1 on the path space -B N formed by the instantaneous 
image under the amalgamation %, of paths in A^ . 

In the present article, instead of focusing on shift -invariant Markov measures, we 
consider a natural generalization of them. Let tp : A N — * K be a 'potential', then, 
under appropriate regularity condition on tp (see more details below), there is a 
unique so-called Gibbs measure associated to it. It is a shift-invariant proba- 
bility measure on A^ with remarkably nice properties. Each r-step Markov measure 
falls in this category, since an r-step Markov measure is nothing but a Gibbs mea- 
sure defined by a (r+ l)-symbol potential, i.e., a potential tp such that ip(a) = tp(a) 
whenever a,i = hi, i — 0, . . . , r, with r a strictly positive integer. On the other 
hand, given tp one can construct a sequence (tp r ) of (r + l)-symbol potentials 
(uniformly approximating tp) such that the sequence of associated r-step Markov 
measures converges to li^ (in the vague or weak* topology, at least). 

Now let B be the alphabet obtained from A by amalgamation of some of the 
symbols of A. The amalgamation defines a surjective (i.e., onto) map tt : A — ► B 
which extends to ^4 N in the obvious way. Given a Gibbs measure on A n , this 
map induces a measure o it -1 supported on the full shift B N . The question we 
address now reads: 

Question 1. Under which condition is the measure Li^p o n , supported on the 
full shift B N , still Gibbsian? In other words, under which conditions on tp can one 
build a 'nice' potential <fi : _B N — > K such that o tt^ 1 = fj,^,? In particular, for tp 
a 2-symbol potential, what is the nature of o ir^ 1 ? 

In this article we make the following answer (made precise below, see Theorems 13. II 
and HI]): 

Under mild regularity condition on tp, the pushforward of the Gibbs 
measure ii^, namely /i^,o7r _1 , is Gibbsian as well, and the associated 
potential (p can be computed from tp. Furthermore, when tp is 
a 2-symbol potential, the corresponding hidden Markov chain is 
Gibbsian, and it is associated to a Holder potential. 

A slightly more general problem is the following. Suppose that we do not start with 
the full shift A N but with a subshift of finite type (henceforth SFT) or a topological 
Markov chain X [9] . The image of X is not in general of finite type but it is a sofic 
subshift [5]: 

Question 2. When X C A N is a SFT, is the measure ll^ o ir^ 1 still Gibbsian? 



The case r = corresponds to product measures (i.i.d. process). 
'We assume B has cardinality at least equal to two. 
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Question [2] has only received very partial answers up-to-date. We shall comment 
on that in Section [5l 

The present work is motivated, on the one hand, by our previous work in [5] in 
which we attempted to solve Question [5] and were partially successful. On the 
other hand, it was motivated by [5J where we were interested in approximating 
Gibbs measures on sofic subshifts by Markov measures on subshifts of finite type. 
Here we combine ideas and techniques both from [5] and [B] but we need extra work 
to get more uniformity than previously obtained. 



Let us mention two recent works related to ours. In [llj . another kind of transforma- 
tion of the alphabet is considered, and the method employed to prove Gibbsianity 
is completely different from ours. In [7], the authors study random functions of 
Markov chains and obtain results about their loss of memory. 

The paper is organized as follows. In the next section we give some notations and 
definitions. In particular, we present the weak* convergence of measures as a pro- 
jective convergence and we define the notion of Markov approximants of a Gibbs 
measure. In Section [3l we state Theorem 13.11 which answers Question [1] when the 
starting potential ip is Holder continuous (its modulus of continuity decays expo- 
nentially to 0). The proof relies on two lemmas which are proved in Appendices 16. 21 
and 16.31 respectively. In Section [H we generalize Theorem 13.11 to a class of poten- 
tials with subexponential (strictly subexponential or polynomial) decay of modulus 
of continuity. We finish (Section [5]) by discussing Question [2] and giving a conjec- 
ture. Appendix 16.11 is devoted to Birkhoff 's version of Perron-Frobenius theorem 
for positive matrices, our main tool. 

We have greatly benefited from the careful reading of an anonymous referee whose 
valuable comments helped us to improve the paper. 



2.1. Symbolic dynamics. 

Let A be a finite set ('alphabet') and A K be the set of infinite sequences of symbols 
drawn from A. We define N to be the set {0, 1,2,.. .}, that is, the set of positive 
integers plus 0. We denote by a, b, etc, elements of A N and use the notation 
(m < n, m, n S N) for the word a m a m+ i ■ ■ ■ a n -xa n (of length n — m + 1). We 
endow A N with the distance 



The resulting metric space (^4 N ,c?4) is compact. 

The shift transformation T : A N — > A N is defined by (Ta) n = a n+ \ for all n £ N. 
A subshift X of A N is a closed T-invariant subset of A N . 

Given a set of admissible words C C A for some fixed integer I > 2, one defines a 
subshift of finite type Ac C A^ by 



2. Background Material 




A c := {a G A : a 



n+l-l 



e C, Vn e N}. 
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A subshift of finite type defined by words in C C A 2 is called a topological Markov 
chain. It can be equivalently described by the transition matrix M : A x A — ► {0, 1} 
such that M(a,b) = Xc{ab), where is the indicator function of the set C. We 
will use both Ac and Am to denote the corresponding subshift of finite type. 

Note that the 'full shift' (A N ,T) can be seen as the subshift of finite type defined 
by all the words of length I, and we have the identification A N = A^i. 

Let X C A n be a subshift. A point a £ X is periodic with period p > 1 if T p a = a, 
and p is its minimal period if in addition T k a ^ a whenever < k < p. We denote 
by Per p (X) the collection of all periodic points with period p in X, and by Per(X) 
the collection of all periodic points in X, i.e., Per(X) = Up>iPer p (X). 

Given an arbitrary subshift X C A N and m £ N, the set of all the X -admissible 
words of length m + 1 is the set 

X m := {to £ ,4 m+1 : 3(iel,«) = a™}. 

It is a well known fact that a topological Markov chain Am is topologically mixing 
if its transition matrix M is primitive, i.e., if and only if there exists an integer 
n > 1 such that M" > 00. In this case, the smallest of such integers is the so called 
primitivity index of M. 

For a subshift X C A N , w £ X m and m £ N, the set 

H :={ael: c^" = w} 

is the cylinder based on w. 

We will use boldfaced symbols a, b, etc, not only for infinite sequences but also for 
finite ones (i.e., for words). The context will make clear whether we deal with a 
finite or an infinite sequence. 



2.2. Thermodynamic formalism. 

For a subshift X C ^4 N , cylinders are clopen sets and generate the Borel cr-algebra. 
We denote by M(X) the set of Borel probability measures on X and by Mt(A) the 
subset of T -invariant probability measures on X. Both are compact convex sets in 
weak* topology. The weak* topology can be metrized [2] by the distance 

Dfau) :=f>-(™ +1 )( £ Mw}-v[w}\). 

m=0 \w£X m ) 

It turns out that the following notion of convergence is very convenient in our later 
calculations. 

Definition 2.1. We say that a sequence ([i n )neN of probability measures in JVl(X) 
converges in the projective sense to a measure /i £ M(A) if for all e > and N > 1 
there exists N' > 1 such that 

Hn[w] 

exp(-e) < — < exp(e) 
/or admissible words w of length k < iV, and /or n > iV'. 



^On the other hand, if none of the rows or columns of M is identically zero, Am is topologically 
mixing implies M is primitive. 
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It is easy to verify that convergence in the projective sense implies weak* conver- 
gence. On the other hand, when all the measures involved share the same support, 
weak* and projective convergence coincide. Though it is the case in this paper, we 
will speak of projective convergence. 

We make the following definitions. 

Definition 2.2 ((r + 1) -symbol potentials). A function ip : A N — > M will be called 
a potential. We say that a potential ip : A N — > R is an (r + 1) -symbol potential if 
there is an r € N such that 

tp(a) = ip(b) whenever a r Q = 6q. 

Of course, we take r to be the smallest integer with this property. 

We will say that ip is locally constant if it is an (r + 1) -symbol potential for some 

r e N. 



A way of quantifying the regularity of a potential ip : A N — * R is by using its 
modulus of continuity on cylinders, or variation, defined by 

var„V := sup{|^(a) - ip(b)\ : a, be A N , aft = b^}. 

A potential ip is continuous if and only if vax n tp — > as n — > oo. An (r + l)-symbol 
potential ip can be alternatively defined by requiring that vax n ip = whenever 
n > r, and thus it is trivially continuous. If there are C > and g €]0, 1[ such that 
v&r n (p <Cg n for all n > 0, then ip is said to be Holder continuous. 

We will use the notation 

n-l 

S n ip(a) :^^T l (a), n = l,2,... 

k=0 

Throughout we will write 

i^wC" for CT 1 < - < C 

y 

for x, y and C strictly positive numbers. Accordingly we will use the notation 
x ^ y exp(±C). We also write x y ± C for — C < x — y < C. 

We now define the notion of Gibbs measure we will use in the sequel. 

Definition 2.3 (Gibbs measures). Let X C A N be a subshift and ip : A N — > R be a 

potential such that ip\x is continuous. A measure /i € Mt(A) is a Gibbs measure 
for the potential ip, if there are constants C = C(ip, X) > 1 and P = P(ip, X) £ R 
such that 

m fMl <r ±1 

1 j exp(5„ +1 V(a) - (n + 1)P) > 

/or all n € N and a G A. FFe denote by fj^/, such a measure. 

The constant P = -P(V ; , X) is the topological pressure [8] of A with respect to "0. It 
can be obtained, for A a subshfit of finite type, as follows: 

(2) P(il>,X) =limsupilog V exp(S„V(a)). 
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We will say that the potential ip is normalized on X if P(ip, X) = 0. We can always 
normalize a potential ip by replacing tp by ip — P(ip,X). This does not affect the 
associated Gibbs measure /i^,. 

In the above definition, we allow that ip — — oo on v4 N \X. In other words, ip is 
upper semi-continuous on v4 N . 

Remark 2.1. If ^ E My(J) is such that the sequence (\og(^[a^\/ M[ a iD)^Li con ~ 
verges uniformly in a E X , then the potential ip : X — > R given by 



is continuous on X, and [i is a Gibbs measure with respect to ip, i.e. /i = /!,/,. 
Furthermore, ip is such that P(ip) = 0. 

Notice that /x[aQ]//i[a™] is nothing but the probability under /i of do given a™. 
Therefore, by the martingale convergence theorem the sequence (log(/i[ao]//i[a™]))^° = 
converges for [i-a. e. a E X. The uniform convergence is what makes [i a Gibbs 
measure. 

We have the following classical theorem. 



Theorem 2.1 ([10j). Let X C A N be a topologically mixing subshift of finite type 
andip :X // 



n=0 

then there exists a unique Gibbs measure [i^, i.e., a unique T -invariant probability 
measure satisfying (p}. 

Remark 2.2. By this theorem we have a partial converse to §3§ in the sense that 
there the potential is defined by the measure, while in the theorem it is the potential 
which defines the measure. 

Notice that the uniqueness part of the theorem is granted by the Gibbs inequality {T]), 
since two measures satisfying it have to be absolutely continuous with respect to each 
other. It is the existence part which is nontrivial. 

For a proof of Theorem l2.1l see e.g. (SJ. This includes the case of Holder continuous 
potentials treated in, e.g., [21 [32]. 

2.3. Markov measures and Markov approximants. 

Markov measures can be seen as Gibbs measures. Colloquially, an r-step Markov 
measure is defined by the property that the probability that a n = a £ A given dp -1 
depends only on a™I*-0 What is usually called a Markov measure corresponds to 
1-step Markov measures. On the full shift, the case r = gives product measures. 
A T-invariant probability measure is an r-step Markov if and only if it is the 
Gibbs measure of an (r + 1)— symbol potential. Given an (r + l)-symbol potential 



We assume that r > 1. The case r = corresponds to an i.i.d. process, in which case the 
Gibbsianity is evident. 



(3) 




(4) 
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ip, which we identify as a function on A r+1 , one can define the transition matrix 
Mip : A r x A r M.+ such that 

MJv,w) := I exp(VK^-i)) ifv[-\=wZ- 2 , 
^ [0 otherwise. 

By vw r _i we mean the word obtained by concatenation of v and w r _i (the last 
letter of w). 

By Perron-Frobenius Theorem (cf. Appendix 16. ip there exist a right eigenvector 
Rtj, > such that J2 a eA r -^V>( a ) = 1) an d a eigenvector > such that 
L^Rip = 1, associated to the maximal eigenvalue < p^ := maxspec(Al^). Then 
the measure p defined by 

(5) M K] := L^ 1 ) J+1 ' M<-r+l), 

for each a £ A N and n € N such that n > r, is easily seen to be a T-invariant 
probability measure satisfying |T]) with 

P = log(p,) and C = $ e-rm ^U^)R^) : «g, t,' | ^ 

where ||?/>|| := sup{|-0( a )l : a, £ A N }. Therefore p = p^ is the unique Gibbs 
measure associated to the (r + l)-symbol potential i/>. 

Markov and locally constant approximants. 

Given a continuous ip : A N — > M, one can uniformly approximate it by a sequence of 
(r + l)-symbol potentials ip r , r = 1, 2, . . ., in such a way that \\ip — tp r \\ < var r (^>), 
which goes to as r goes to 00. The Vv's are not defined in a unique way but this 
does not matter since the associated r-step Markov measures p^ r , approximate 
the same Gibbs measure p^. We can choose ip r (a) := max{^(b) : b G [«□]} f° r 
instance. 

The potential ip r will be called the (r+1) -symbol approximant of tp and the associ- 
ated j — step Markov measure p^ r will be the rt/i Markov approximant of /i^,. ft is 
well known (and not difficult to prove) that p^ T converges in the weak* topology 
to p^. 



3. Main Result 



The next theorem answers Question [T] when tp is Holder continuous (Theorem I3.1|> . 
For the sake of simplicity we discuss the generalization of that theorem to a class 
of less regular potentials (i.e., var n (V>) decreases subexponentially or polynomially) 
in Section [4] 

Amalgamation map. Let A, B be two finite alphabets, with Card(A) > Card(B), 
and 7T : A — > B be a surjective map ('amalgamation') which extends to the map 
7T : A N — > £> N (we use the same letter for both) such that (7ra)„ = Tr(a„) for all 
neff. The map it is continuous and shift-commuting, i.e., it is a factor map from 
A N onto B N . 
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(6) 4>(b) — lim lim log 



+00 n — >oo 



Theorem 3.1. Let ir : A N — > i? N be the amalgamation map just defined ano 
i/j : A N — > K 6e a Holder continuous potential. Then the measure fM/> ° is c 
Gibbs measure with support B^ , for a potential (j> : B N — > M smc/i i/iai 

var n (</>) < 2?exp(— cVn) 
/or some c, I? > 0, and all n £ N. 

Furthermore, this potential <j> : B N — > R is normalized and it is given by 

>Vv Q7r- 1 [boY 
v MVv t 1 " -1 [ & 1 ] 
where ip r is the (r + l)-symbol approximant of ip. 
If ip is locally constant, then for all n 

var„0) < CtT 

w/iere ■& e]0,l[, C > 0. 

The case of locally constant potentials in the theorem can be rephrased as follows: 
When ity is an r-step Markov measure, with r > 0, the pushforward measure ji^ o 
7T _1 , i.e. the hidden Markov measure, is a Gibbs measure for a Holder continuous 
potential <f> given by 



(7) m= 1™ log, 

The case r = is trivial: the Gibbs measure is simply a product measure and its 
pushforward is also a product measure. 



The proof of Theorem 13.11 relies on the following two lemmas whose proofs are 
deferred to Appendices 16.21 and 16.31 

Lemma 3.1 (Amalgamation for (r+1)— symbol potentials). The measure /Xi/, r o7r , 
with r > 0, is a Gibbs measure for the potential <fi r : B N — > R obtained as the 
following limit 

(8) Mb) := lim log ff^{|j 

Furthermore, there are constants C > and € [0, 1 [ such that, for any positive 
integer n > r and for any b £ J3 N we have 

Hr "" _1 [ b o] 



(9) 



•(b) " log 



M«/v n 1 IK] 



< C'i 



Lemma 3.2 (Projective convergence of Markov approximants) . The sequence of 
measures {[i^ r ) converges in the projective sense to the Gibbs measure fj,^ associated 
to the potential ip. 

Furthermore, for all n, r > and w G A n , we have ^ r [w] S3 fJ-ip[w] exp(±e ri „) 7 
where 

00 

(10) e r ,n -=DJ2 i( n + ( s + !)( s + 2))var s ^ + s6» s ) , 

s—r 

/or adequate constants D > and # € [0, 1[ fi/ie same 6 as in Lemma \3.1\) . 
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With the two previous lemmas at hand, we can proceed to the proof of Theorem l3.ll 



Proof of Theorem 1 3. 1\ 

We start by proving that the sequence (//vj r oir~ 1 ) r converges in the projective sense 

tO [l^p O 7T _1 . 

On the one hand, Lemma l3. II tells us that the measure v r :— fx^ r o ir^ 1 is Gibbsian 
for the potential 4> r : B N — > R given by 

'M b oY 



.(6) = lim log 



.[&?; 



On the other hand, Lemma I3T21 ensures that for each n, r > with n > r, and each 
v £ A n , we have ^ r \v] ^ H$[v] exp(± e rj „) where e ri „ is defined as in (fTO) . From 
this it follows that for each w £ B n we have 



v r \w\ := 



VGA 71 •.TTV—W 



exp(±e rj „) ^2 Hi v \ 

VGA n :TTV — W 



(11) exp(±e r! „) o 7r [iu]. 

Otherwise said, the sequence of approximants [y r = //^ r o 7T" 1 ),,, converges in the 
projective sense to the induced measure o 7r _1 , and the speed of convergence is 
the same both the factor and the original system. 

Now we prove that the pushforward measure v := o ir^ 1 is a Gibbs measure. 
According to Lemma T3.ll and Eq. (fTT|) . for any 6 € B N , and n, r > with n > r, 
we have 



(12) 



-(6) -log 



< 2e r ,, 



Let us take, for each r > 0, n = n(r) := r 2 , and let r* > be such that both 
s i— > s 2 9 s and s > e s s 2 define decreasing functions in [r*,oo). Hence, using the 
triangle inequality we obtain 

\4> r (b) - 4> r '{b)\ < 2 (2e r , r 2 +Cr 2 9 r ) 

for all r* < r < r' , and for any b £ £? N . This proves uniform convergence of the 
sequence of potentials (0 r ) r . The limit is the continuous function 
defined by 



0(6) := lim log 



If we verify that satisfies condition then, according to the observation follow- 
ing Theorem l2.ll this will prove that v = fi^p o ir^ 1 is the unique Gibbs measure for 
0. From Ineq. (ITS]) it follows that 



|0(b) -0(6)| < 



+ 



m-Mb)\ 

(6) - log 



<M&) - log 



!/[&?] 



|0 r (&) - 0(6)| 



< 4(2e„2 +Cr 2 6i r ) + 2 (2e,,„ + Cr 2 r ) 
for all 6,6 £ B^ such that 6 £ [&£] , and every n > r > r* . 



10 



JEAN-RENE CHAZOTTES & EDGARDO UGALDE 



Since ip is Holder continuous and 

oo 

e nr 2 := D^2(r 2 + (s + l)(s + 2))var s ^> + s9 s ), 

s—r 

then there exist C > and g € [9, 1[ (remember that # € [0, 1[) such that max(e rl ,2, r 2 # r ) 
Cp r . We take again n = n(r) = r 2 and obtain, for all neN, 

va,r n cf> < 2?exp(— c-s/n) 

with a convenient V > 6C(2 + C), and c = — log(o). 

The case of a locally constant ■i/' is the immediate consequence of Lemma 13.11 and 
one has $ = 6* . 

The theorem is now proved □ 

Remark 3.1. The competition between the terms e r ^ n and Q n l r in the upper bound 
ofvax n (f> leads to a subexponential bound, namely var„</> < 2?exp(— cn 1 ^ 1 ), for any 
S > 0. We made the choice 8=1. 

4. Generalization to less regular potentials 

In this section we go beyond Holder continuous potentials and look at potentials 
ifi such that v&r r (i/)) decreases slower than exponentially. Besides the fact that 
^ r var r "0 < oo is always assumed, the only place where a finer control in the de- 
crease of var r (^i) is required, is inside the proof of Lemma l3.21 There, the projective 
convergence of the Markov approximants depends on the fact that 

oo 

e r .„ := £>y^((n + (s + l)(s + 2))var s V + s8 s ) -> 0, when r -> oo, 

s—r 

for each n > 0. Furthermore, the variation of the induced potential, var„</>, is upper 
bounded by a linear combination of and n6 n / r . After this consideration, we 

can generalize Theorem 13. II as follows. 

Theorem 4.1. Let ir : A N — > _B N be the amalgamation map just defined and 
ip : A N — > K be such that s2 var s ^ ^ °° - ^ e nieasure [i^ o 7r _1 is a 

Gibbs measure with support i? N /or a normalized potential <f> : B N — > M defined by 
the limit 

m = iim hm log r ^^:;;gn , 

r— >oon— >oo VM'/'r ^ [VI J / 

where ip r is the (r + 1)— symbol approximant of ip. 

Ifvax n ip has subexponential decreasing, i.e., ifva,r n -ip < C exp(— crt 7 ) /or some c, C > 
cmd 7 G]0, 1[, then there are constants D > C and < d < c such that 

var„(0) < Dcxp (—dn 1 ^^ 

for all n £ N. 

Ifvax n ip is polynomially decreasing, i.e., if var n i/> < Cn~ q , for some C > and 
g > 3, i/ien /or a/Z e G (0, q — 3) i/iere is a constant D > C such that 

1 
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for all n € N. 



Remark 4.1. As mentioned above, then-variation of the induced potential is upper 
bounded by linear combination of e r _ n and r 2 Q n / r . We have to optimize the choice 
of the function r i— > n(r) in such a way that n/r — > oo when r — > oo, and that the 
resulting n -variation of ip has the fastest possible decreasing. In the subexponential 
case, var„-0 < Cexp(— en 1 ), the optimal choice turns to be n(r) = r l+1 , while 
in the polynomially decreasing case, var„-0 < Cn~ q , the optimal choice is n(r) = 
r (<?- 1 )/(9- 1 - £ ) _ This gives a bound in n~ q+2+€ . 

5. Comments and Open Questions 



In our previous work [5] we made two restrictive assumptions, namely that ip is 
a locally constant potential and the image of the starting SFT under the amalga- 
mation map 7r is still a SFT (in general it is a sofic subshift). In that setting, we 
could prove, under sufficient conditions, that pi^ o tt^ 1 is a Gibbs measure for a 
Holder continuous potential <f>. We also exhibited an example showing that one of 
our sufficient conditions turns out to be necessary in that otherwise the induced 
potential (j) is not defined everywhere. 

We conjecture the following: Let 7r : A — > B be an amalgamation map as above, 
X C A N a SFT and Y C B N the resulting sofic subshift. Then the pushforward 
measure of a Gibbs measure for a Holder continuous potential is a "weak" Gibbs 
measure jj,^ in that ([I]) does not hold for every a but for almost all a (w.r.t. (i^). 



6. Proofs 



6.1. Preliminary result: Birkhoff's refinement of Perron Frobenius The- 
orem. 

Let E, E' be finite sets and M : E x E' — > M + be a row allowable non-negative 
matrix, i.e., a matrix such that Mx > whenever x > 0. Let us define the set 



A, 



lxe]0,l[ E : |x|i :=5>(e) = ll 

I eeE J 



and similarly Ae>- We supply with the distance 

x i \ i x ( e )y(f) 

d B {x,y) := max log———-. 

e,/6E x(f)y(e) 

On A^v we define Jg/ accordingly. Let us now define 



1 - JMM) 
t(M) := V V ' 

1 + y/$(M) 

where 

M(e,e')M(f,f) 



if M > 0, 



4>(M) := J e ^E,e',/'GB< M(e,f)M(f,e') 

I otherwise. 
Here M > means that all entries of M are strictly positive. 
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Theorem 6.1 (After Garrett Birkhoff). Let M : E X E' ->• R+ be row allowable, 
and Fm ■ Ae> — ► be such that 

Mx „ , . 
FmX :— —— . tor each x G Ae>- 
\Mx\i 

Then, for all x, y £ Ag/, we ftave 

S E (F M x,F M y) < T[M)8 E >(x,y). 
We have t(M) < 1 if and only if M > 0. 

For a proof of this important result, see [I] for instance. It can also be deduced 
from the proof of a similar theorem concerning square matrices which can be found 
in [T3]. As a corollary of this result we obtain the following form of the Perron- 
Frobenius Theorem. 

Corollary 6.1 (Enhanced Perron-Frobenius Theorem). Suppose that M : E X 
E — > R + is primitive i.e., there exists IeN such that M e > 0. Then its maximal 
eigenvalue pM is simple and it has a unique right eigenvector Rm £ Ae, and a 
unique left eigenvector Lm satisfying L m Rm = 1. Furthermore, for each x € 
and each n GN we have 



M"x ^ (L^x)^i? M exp ( ± a \ ' J" y r 
with t := t(M £ ) < 1. 



Proof. Let us first remark that -Fj^ = fjj- Since M > 0, then Theorem 16.11 
and the Contraction Mapping Theorem imply the existence of a unique fixed point 
xm = FmXm € Ae such that 

oo 

6 E (F M x,x M ) < Y,6E(FM +H x>K +(k+1)e x) 

< Se(x,F m x)t^/^ < ^ g ( a! ,F Ma! )rL"/ < J 

1-T ~ 1-T 

for each n £ N and x € A^. From the definition of projective distance it follows 
that, for each x G A E and n GN there is a constant C — C(x, n) such that 

(13) M n x § C(x, n) x M exp (^ ^(x.Fmx) ^ \ 

Let us now prove that xm = Rm G A^ is the unique positive right eigenvector 
associated to the maximum eigenvalue pu '■= maxspec(M). Indeed, since FmXm — 
xm, then Mxm = Axm for some A > 0. Now, if My = Xy for some y £ <C E , and 
taking into account that M is a real matrix, then y = a z for some a € C and 
z 6 Ae- Therefore A is a simple eigenvalue. It follows from Theorem 16.11 and the 
contraction mapping theorem that z = Rm is the associated eigenvector. 
Consider the map x h- > mm ee E{Mx)(e)/x(e) on Ae, and extend it to cIos(Ab) 
(the closure is taken with respect to the euclidean distance), by allowing values 
in the extended reals 1 := 1U {°°}- The resulting transformation is upper 



'Here we are following a standard argument which can be found in 1 131 for instance. 
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semicontinuous, and therefore there exists Xo G clos(A£:) attaining the supremum, 
i.e., such that 

. (Mx)(e) . (MxoXe) 
p := sup mm — — = mm — — . 

x eA E x(e) eeE x (e) 

This supremum is an eigenvalue, and the point where it is attained is its correspond- 
ing positive eigenvector. Indeed, if Mxq ^ px$, i.e. if (Mxo)(e) > pxo(e) for some 
e G E, then M 1+1 Xq > pM e xo which implies that p < sup xeAE min e& E(Mx)(e)/x(e). 
Therefore Xq is a non-negative eigenvector for M, but since M e x Q = p e x > 0, 
then necessarily Xq = Rm and A = p. 

Finally, if 7^ y G C E is a right eigenvector of M, associated to another eigenvalue 
A' G C, then 

\X'\\y\ = \My\<M\y\, 

where \z\ denotes the coordinatewise absolute value of the vector z G C E , and the 
inequality holds at each coordinate. If |A'| < min eS £;(|(My)(e)|)/(|y(e)|), we can 
find a vector y + G by slightly changing \y\ at coordinates e G E where y(e) = 
and then normalizing, so that |A'| < min ee £;(Afy + )(e)/y + (e). If on the contrary 
|A'| = mm eeE |(My)(e)|/|y(e)|, then M e+1 \y\ > |A'|M £ |y|, and normalizing M l \y\ 
we obtain y + G A# such a way that |A'| < min e6 £;(My + )(e)/y + (e). We conclude 
that, 

■ (Mx)(e) 
A < sup min- := p 

for each A' G spec(M), therefore p = pu '■= maxspec(M). 

It remains to prove that in Ineq. (fT3|) . we have C{x, n) — (L\ { x) p 7 ^, where Lm > 
is the left eigenvector associated to pm, normalized so that L m Rm = 1- For this 
note that, by multiplying Ineq. (|13p at left by Lm, we obtain 

t -<n\ sC (T.t - fi,A H(<r ^ m (JS E (x,F M x) ln/e} 



□ 



p n M {L M x) ^ (Ll I R M )C(x,n) exp \± > r 
hence C(x, n) — p 1 m (L' m x)/(L' m Rm), and the proof is finished 



6.2. Proof of Lemma [HU 



6.2.1. The right eigenvector. 

Notice that the transition matrix M. r := M.^ r is primitive with primitivity index 
r, hence, according to Corollary 16. II 

M n r x ^ (Ztx)p?R r exp ( V^y^ rL^ , 

for each x G and n G N. Here p r denotes the maximal eigenvalue of Ai r , 
R r G A^r its unique right eigenvector in the simplex, L r its unique associated left 
eigenvector satisfying L\R r = 1, and r := r(jM^) denotes the contraction coefficient 
associated to the positive matrix M r r . 
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Let us now obtain explicit an upper bound for t and for the distance SA r (x, F r x) 
for particular values of x g Ag. First, 

> mm — -J rfirTT -( > exp -2 ) var fc ^ > 0. 

u,v, u ',v>,u",v"eA r M r (u, v')M r {v,u") V y 

Therefore r < 1 - e" S^=o var ^ and (1 - r)" 1 < e^J=o var ^. 

Let := X^fcLo var fcV'j an d :=1 — e~ s '* . With this, and taking into account the 

upper bound for r and (1 — r) _1 , we obtain 



(14) M> ^ (L\.x)fiR r exp (±r^(£c, i^) e s <" (9 LrJ 

On the other hand, for u := (l/(Card(A r )), . . . , l/(Card(A r )))t g A A r, we have 

, „ _s , / u{w')(M r u)(w)\ 

6 A r{u,F r u) := max log 

™,™'eA'' \u(u;)(.M r u)(t(r ) / 

< r log(Card(A)) + 2||^|| < rC , 
where Co := 2 (log(Card(A)) + ||^|| ). Therefore, by taking x = u and n = r 2 



in (|T4|) . we finally obtain 

e S r 2_ r _ 1 ip r (a) 

(15) R r (u) < t^^^yp e ±C„r a cxp(«*)«". 

Pr \Lr\l 



6.2.2. Ansatz for the induced potential. 

To each word w £ B r we associate the simplex 

A w :=\xe (0,1) B - : |x|i := 



H ^ = 1 f 



where E w := {« 6 # : 7rt> = it)}. 

Let M r ,Pr,L r ;= L^ r and _R r := i2^, r be as above, and define, for each w g B 7 ', 
the restriction L rw :— L r \^ w € (0,oo) Ew . Define R r ^ w in the analogous way, 
and for each to g B r+1 let M. r ,w be the restriction of M. r to the coordinates in 
x Using this, and taking into account Eq. ([5]), which applies to our 

(r + 1)— symbol potential ip r , we derive the matrix expression 



7ra o= b o y pr y 

for the induced measure v r := fi^ r o ir^ 1 . It follows from this that 



V 



(ir,6 I ) T n"=i r ^ r ,^+-^b»_ r . +1 



- log(p r ). 



For each «; g A r+;i , with s > 1, let Ai r ,w '■= IIi=o j+r > an d define the 
transformation F r w : A w r+ s -i —> A w r-i such that 

f rln X 



\M r , w x\l 
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For each b £ B N and s, t £ N, let 

X b s+t+r := F b s+r+l O • • • O F ,,+t+r ( R ,+t+r /\R tf+t+r \ X ) £ A^ + r- . 

By convention, a; = R h s+r/\R„, »,s+r|i £ A. s +r. Using this notation, and after 
the adequate renormalization, Eq. (fl6|) becomes 



(17) log = ^ 1 ' r* - - I - log(*). 



6.2.3. Convergence of the inhomogeneous product. 

Let us now prove the convergence of the sequence ( x b™) n>r - For this notice that 



XrA™ ■— F r b r+i o • • • o F r ,6j»_ r X r ,6j 



-7-+1 



- F r>6?! r r ,6»_ r+1 - F r b 2r O F r>6 3r x O • • • O K,b$t?f r+1 Xr ' b *r+l> 

where k := I— J — 1. Now, since M. r , w > for each to G B 2r , then Theorem 16.11 
ensures that the associated transformation F r w : A^r-i — > A w r-i , is a contraction 

with contraction coefficient r w = (1 — V&w)/ (1 + v 7 ^uT), where 

A^ r ,«j(w,«')-^r,t«(w,u') 
*™ := mm , . ' 7 n ' ; 77 

i^eE^-i , u',u'e,E^2r-i A4r lTO (r, 'U').M r . tl ,('U, tr) 
M r ,w(v,v')M r ,w{u,u") 

> mm 1 1 

u,ti,» , ,ii , ,i;",M"€yl r A^ r ,U}('U, tl')A4 r ,iu (u, f") 

(18) > exp ( -2^var fc ?/> ] > e~ 2s * > 0. 

V k=o J 

Recall that s^p — X}fc=o var fcV'- From Ineq. (|18[) we obtain a uniform upper bound 
for the contraction coefficients, t w < 9 := 1 — cxp(— s^,) < 1, which allows us to 
establish the uniform convergence of the sequence { x b™) n>r with respect to b £ B N . 
Indeed, for b £ B N fixed and m > n, we have 

(19) (xb?,x K ) < 9 k S E (x rKr+i ,x r , K \ 

fcr+1 X ' 

where k := [^\ — 1. On the other hand, 

k' 



3=0 



"fcr+1 



where k' := \_(m — n)/r\ — 1. By convention, when fc' = —1, the summation in the 
right-hand side is zero. Then, since all the matrices M r ,w are row allowable and 
positive for w £ B 2r , then we have 

(20) S E (x r , b n x r>bf ) <T 1+ T 2 + T 3 

"kr+l V ' 
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where 

(21) T< :=<%,„ [>>--rb»_ r+1 ,F rX+ r i X rib n J 



_ r+ l \ ' n-r+x- M«„_ r+1 



(22) T 2 :=^6t8 E (* rV +? ,F, b „ +u+1 >, x n+ u + ,>\ 



and 



( 23 ) T 3 : = *V«^ (^^•^^av.as-H.,) 

n + k'r+l V ™ + r+1 / 



Once again, we convene that T2 = if k' = — 1. 

Now, for each w,w' 6 _B r , and v <E B s with r < s < 2r, and such that = 
to, uf_ r+1 = to', we have 



^■u* (•Ej^tw; ^r,V'^r,w J ) max log 



^.(u) {M r ,vRr,w'){u') 



< max log . 

u,u'tE w \R r {u') {A4 r ^R r ^ W ')(u) 

Hence, using the estimate for the right eigenvectors given in Eq. (|15p . it follows 
that 



&w ip^r,w i F r v x r w ' ) 5^ max log 



o£Pcr r 2 (A N ) aePer r 2 + s _,.(A ?i ) 



min e S - 2 — ^" (a) min e S' r 2 +8 _ 2r _ 1 ^(a) 

aePcr r2 (A N ) a£Pcr r2+s _ r (A N ) 



+ 2r 2 C e^ 6 r < 2r(r+ l)C (e s * 6 r + 1), 

with C = 2(log(Card(j4)) + ||V>||) and = 1- exp(-s^) as in Eq. fig]). Using this 
upper bound in ([2T]). l[22]) and p3|). we obtain from ([2U|l 



^ (W,6 L+1 , av,6 ?r+1 ) < 2r(r + 1) C (e s * (T + 1) (> + ^—^ , 
and with this, Ineq. (fl"9|) becomes 

(24) (x 6r ,x 6r ) <2r(r + l)Co(e 5 *^ + l)0LfJ- 1 fflL* J-i + _L_j 

which holds for all b € £? N and r < n < m. Hence, (xf,") n >r is a Cauchy sequence in 
complete space Aj,r, and the existence of the limit x^ ;= lim IJWOO x^™, is ensured 
for each b S £? N . Furthermore, from Eq. (|24[) it follows that 
(25) 

fe 6I (a5 br ,x 6r ) <2r(r + l)C (e s *9 r + 1)9^~ 1 WJ-i + <Cir 2 9^, 
with Ci := 4C (1 + e s * 0)/(6 2 (l - <?)). 



ON THE PRESERVATION OF GIBBSIANNESS UNDER SYMBOL AMALGAMATION 17 



6.2.4. The induced potential and the Gibbs inequality. 
Taking the Eq. ([25]) . it follows that the limit 



(26) (j) r (b) = lim log 



log 



log(Pr), 



exists for each b G B N , and defines a continuous function b i— > 4> r (b). This proves 
that the limit ([8]) in the statement of the lemma does exist. It remains to find an 
upperbound to its modulus of continuity. 

Inequality ([23]) . and the fact that |a?b«|i = l^b^li = 1' imply that 

x b ™ ^ x bT exp (± C\r 2 d - ) 

for all b 6 i? N and n > r. With this, and taking into account Eqs. (117|) and (|26|) . it 
follows that 



•(b) -log 



< 



log 



(L r ,b^y -Mr^Xr^ 



log 



Xr.b 1 } 



(L r< bi) x r ^f 



for all b e S N , n > r and C := 2Ci = 8C (1 + <9)/(6> 2 (l - 9)). This proves © 
in the statement of the lemma. 

From this it can be easily deduced that v r = ju,^ r o 7r _1 satisfies the Gibbs Inequal- 
ity |T]) with potential <j) r and constants P(4> r , B n ) = and 



CU r ,B N ) = max 
beB" 



exp(S r 2<j) r 


(6)) 




i r 2 
b 





exp(S r 2(f> r (b)) 



exp 



CV 2 6» r 



This proves the first statement of the lemma the proof of which is now complete. 
□ 



Remark 6.1. As mentioned above (see ©J, the topological pressure of?p is given 
by 



P(4>)^P(^A N )= lim - log ( V 

\aePer„(A Fi 



Since ^> ^ i/v ± va,i r ip, we get 



log(Pr) 



lim — log (Trace (M™)) 

n — >oo 77, 



^ lim - log V e s "^ a > ± var^ 

n^oo n I ^ — ' / 

\aePer„(A Fi ) / 



P (ip) ± var r ?/>, 



/or eac/i r € N. 



6.3. Proof of Lemma HOI 
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6.3.1. Periodic approximations. 

Each Markov approximant can be seen as the limit of measures supported on 
periodic points as follows. Fix n,r G N with n> r, and w £ A n . Then, for each 
p > r + n we have, 



w\ := 



gSp-0r(l) 

We can rewrite the above equation as 



qgPcr p (A w )n [tu] e 

EaePcr p (A N ) 1 



(nrx 1 w^ +r -\<ti)) <n_xM? 

7jWf«l = ^ 



with M r as above, and € {0, 1} A the vector with 1 at coordinate £ and zeros 
everywhere else. Now, since > f° r each k > r and ( e A r , then Corollarv l6.ll 

applies, and using §5§ we obtain 

Vi p) [w] ^ i °—L-± =, j-= '- 



e ± i-x 



n— r 

with R r ,L r ,p r and r := t(.M£) as before, and 

D := 2 max^r {M r r e^ M r r +1 ec) . 

Since r = (1 - \/$)/(l + \/$), with 

M r<w (v,v')M r , w {u,u') ( -A A 

* := mm , „ . TT~ — i \ a a ' 7 ^ ex P " 2 V var ^ > 0, 

(V, U )M r ,w \U, V ) \ f— ' / 

then r < 6> := 1 — exp (— s^), and (1 — t) _1 < exp (s^), with := X^fclo var fc^ as 
in Lemma |3. II On the other hand, 

D < 2 max log 1 



tf^u'ei^" VAlr +1 (M,t«')-^r(' u/ : W ') 

< 4 Mog(Cardpl)) + ^jvar^ + ||^||J 

< 4(log(Card(^))+s Vj + ||V||)=:-Di. 
Using this explicit bound just obtained, we deduce the inequalities 

V p S p tp r (a) 
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for each w 6 A n , and all p > n + r. It is easy to check that these inequalities extend 
to each w £ U% =1 A k , and we finally obtain 

V 1 „S p Tp r (a) 



(27) [w] % exp l±Dx r 

for all r, n £ N, and G A™ . 



6.3.2. Telescopic product. 

Let us now compare two consecutive Markov approximants. Fix n, r > 0, and 
w 6 A". Then, for each p > n + r + 1, Inequalities (f2"T)) ensure that 

r 1 V* f>Spil> r {a) V-i „S p i/v+i(a) 
MjvW 2^agPcr p (A M )nH 2^agPcr p (A w ) e / „ „EZ2. 

/; , r«>l > V pS p «/v + i(a) V P S p ipr(a) eXP I ° r 

Wv+iL" 7 ] 2^aep crp (AN)n [to] e 2^aePer p (A N ) e p v 

with q = max(r + 1, n) and C := 2e s ^' Z?i. Since Vv+i ;s Vv ± var r+ i?/>, then we 
have 



7. rTTTT > 



exp ( ± ^2p var r +iV> + C r P ^ ' 

*s exp ^± ^2 pvar r+ i^ + Cr 6* £ 7 

for all r € N, € U)J =1 A fe , and p > n + r + 1. Let p = (r + l)(r + 2) + n - 1, then 
for each r' > r e N and to S U^ =1 A fe we have 

^exp ( ±DY ((n+( S + l)( S + 2))var s ^ + S s ) ) , 
V ^ / 
with := max(2, C). Since -0 is Holder continuous and 6 e (0, 1), then 

oo 

e r ,„ :=J)V ((n + (s + 1)0 + 2))var s ?/; + s6» s ) -> when r -> oo, 



for each n,r e N. We conclude that, /x[tu] := lim^oo fj,^ r [w] exists for each 
w € U^ Q yl fe , and we have 

^ exp iflV ((|tu| + (a + + 2))var s ^ + s6 s ) j , 

for every r S N and w G Uj^-A*. 

6.3.3. The limit lim r _>ooMi/v * s ^ e Gi66s measure fj,^. 

It only remains to prove that p such that /x[tu] := linv^oo /x^ r coincides with the 
original Gibbs measure fi^. Note first that /i so defined is T-invariant. Indeed, it 
is the weak* limit of the sequence (/i0 r )r>i of T-invariant Markov approximants, 
it is a T-invariant probability measure as well. 

Now, replacing tp r by ip±vax r ip, and making p = (r+ l)(r + 2) + n— 1 in Ineq. (|27p . 
it follows that 



H[w] ^ IhA w \ exp(±e ri „) 



ePer p (A N )n [to] 



S p ip(a) 



2^aePer p (A N ) e P 



exp (± 2e r , n ) 
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exp (± 2e r ^ r 2^j 



2 



for each r 6 N and w 6 U^ =1 ^4 fe . On the other hand, the Gibbs measure /i,/,, whose 
existence is ensured by the fact that ^2 r var r i/; < oo, is such that 



and Ineq. that /i is absolutely continuous with respect to /i^. The Ergodic 
Decomposition Theorem implies that /x^, is the only ergodic measure entering in 
the decomposition of the invariant measure /i, therefore ft = /i^. 
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oo, it follows from this 
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